Feature gene selection method based on logistic and correlation information entropy
نویسندگان
چکیده
منابع مشابه
Feature Selection Based on Mutual Correlation
Feature selection is a critical procedure in many pattern recognition applications. There are two distinct mechanisms for feature selection namely the wrapper methods and the filter methods. The filter methods are generally considered inferior to wrapper methods, however wrapper methods are computationally more demanding than filter methods. A novel filter feature selection method based on mutu...
متن کاملInformation-based Feature Selection
Feature selection is a topic of great interest in applications dealing with high-dimensional datasets. These applications include gene expression array analysis, combinatorial chemistry and text processing of online documents. Using feature selection brings about several advantages. First, it leads to lower computational cost and time. Less memory is needed to store the data and less processing...
متن کاملLogistic Regression Models for a Fast CBIR Method Based on Feature Selection
Distance measures like the Euclidean distance have been the most widely used to measure similarities between feature vectors in the content-based image retrieval (CBIR) systems. However, in these similarity measures no assumption is made about the probability distributions and the local relevances of the feature vectors. Therefore, irrelevant features might hurt retrieval performance. Probabili...
متن کاملFeature Selection Based on Information Theory
Feature selection is an essential component in all data mining applications. Ranking of futures was made by several inexpensive methods based on information theory. Accuracy of neural, similarity based and decision tree classifiers calculated with reduced number of features. Comparison with computationally more expensive feature elimination methods was made.
متن کاملFeature Selection based on Information Gain
The attribute reduction is one of the key processes for knowledge acquisition. Some data set is multidimensional and larger in size. If that data set is used for classification it may end with wrong results and it may also occupy more resources especially in terms of time. Most of the features present are redundant and inconsistent and affect the classification. In order to improve the efficien...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bio-Medical Materials and Engineering
سال: 2015
ISSN: 1878-3619,0959-2989
DOI: 10.3233/bme-151498